智能论文笔记

Forged Image Detection using SOTA Image Classification Deep Learning Methods for Image Forensics with Error Level Analysis

Raunak Joshi , Abhishek Gupta , Nandan Kanvinde , Pandharinath Ghonge

分类：计算机视觉 | 机器学习

2022-11-28

The advancement in the area of computer vision has been brought using deep learning mechanisms. Image Forensics is one of the major areas of computer vision application. Forgery of images is sub-category of image forensics and can be detected using Error Level Analysis. Using such images as an input, this can turn out to be a binary classification problem which can be leveraged using variations of convolutional neural networks. In this paper we perform transfer learning with state-of-the-art image classification models over error level analysis induced CASIA ITDE v.2 dataset. The algorithms used are VGG-19, Inception-V3, ResNet-152-V2, XceptionNet and EfficientNet-V2L with their respective methodologies and results.

translated by 谷歌翻译

Metric Effects based on Fluctuations in values of k in Nearest Neighbor Regressor

Abhishek Gupta , Raunak Joshi , Nandan Kanvinde , Pinky Gerela , Ronald Melwin Laban

分类：机器学习

2022-08-24

机器学习的回归分支纯粹集中于连续值的预测。监督学习分支具有许多基于回归的方法，具有参数和非参数学习模型。在本文中，我们旨在针对与基于距离的回归模型相关的非常微妙的点。所使用的基于距离的模型是K-Nearest邻居回归器，它是一种监督的非参数方法。我们要证明的观点是模型的k参数的效果及其影响指标的波动。我们使用的指标是根平方误差和R平方拟合的优点，其值相对于K值的值表示。

translated by 谷歌翻译

Prediction of Maneuvering Status for Aerial Vehicles using Supervised Learning Methods

Abhishek Gupta , Sarvesh Thustu , Riti Thakor , Saniya Patil , Raunak Joshi , Ronald Melvin Laban

分类：机器人 | 机器学习

2022-06-02

航空车遵循基于纬度，经度和高度的引导方法。该信息可用于计算沿轨迹线的机动车辆的机动状态。这是一个二进制分类问题，可以利用机器学习来解决此类问题。在本文中，我们提出了一种使用线性，距离度量，判别分析和增强合奏监督学习方法来得出机动状态及其预测的方法。我们在结果部分中沿行沿线提供各种指标，从而对适当的算法进行了简短的比较，以预测操纵状态。

translated by 谷歌翻译

Discriminant Analysis in Contrasting Dimensions for Polycystic Ovary Syndrome Prognostication

Abhishek Gupta , Himanshu Soni , Raunak Joshi , Ronald Melwin Laban

分类：机器学习

2022-01-09

已经配制了许多预后方法，用于早期发现使用机器学习的PCOS的多囊卵巢综合征。PCOS是二进制分类问题。减少方法对机器学习的性能变得更大，并且使用监督的维度减少方法可以给我们一个新的边缘来解决这个问题。在本文中，我们将不同尺寸的判别分析与线性和二次形式进行不同尺寸，以及二进制分类以及度量。与许多常用的分类算法相比，我们能够达到良好的准确性和判别分析的变化，并且使用二次判别分析达到95.92％的95.92％的测试精度。纸张还提供了具有可视化的数据，以便更深入地了解问题。

translated by 谷歌翻译

Succinct Differentiation of Disparate Boosting Ensemble Learning Methods for Prognostication of Polycystic Ovary Syndrome Diagnosis

Abhishek Gupta , Sannidhi Shetty , Raunak Joshi , Ronald Melwin Laban

分类：机器学习

2022-01-02

利用恒星精度利用机器学习技术使用临床资料的预后，是目前最重要的真实世界挑战之一。考虑到又称PCOS的多囊卵巢综合征的医学问题是15至49岁的女性的新出现问题。通过使用各种升压集合方法诊断这种疾病是我们本文提出的。自适应提升，渐变升压机，XGBoost和Catboost之间的详细和简化差异，具有各自的性能指标突出显示数据中的隐藏异常及其对结果的影响是我们在本文中提出的。本文已经使用了混淆矩阵，精度，召回，F1得分，FPR，ROC曲线和AUC等度量。

translated by 谷歌翻译

Weakly-Supervised Semantic Segmentation of Ships Using Thermal Imagery

Rushil Joshi , Ethan Adams , Matthew Ziemann , Christopher A. Metzler

分类：计算机视觉

2022-12-26

The United States coastline spans 95,471 miles; a distance that cannot be effectively patrolled or secured by manual human effort alone. Unmanned Aerial Vehicles (UAVs) equipped with infrared cameras and deep-learning based algorithms represent a more efficient alternative for identifying and segmenting objects of interest - namely, ships. However, standard approaches to training these algorithms require large-scale datasets of densely labeled infrared maritime images. Such datasets are not publicly available and manually annotating every pixel in a large-scale dataset would have an extreme labor cost. In this work we demonstrate that, in the context of segmenting ships in infrared imagery, weakly-supervising an algorithm with sparsely labeled data can drastically reduce data labeling costs with minimal impact on system performance. We apply weakly-supervised learning to an unlabeled dataset of 7055 infrared images sourced from the Naval Air Warfare Center Aircraft Division (NAWCAD). We find that by sparsely labeling only 32 points per image, weakly-supervised segmentation models can still effectively detect and segment ships, with a Jaccard score of up to 0.756.

translated by 谷歌翻译

Cross-Domain Consumer Review Analysis

Aditya Pandey , Kunal Joshi

分类：机器学习

2022-12-23

The paper presents a cross-domain review analysis on four popular review datasets: Amazon, Yelp, Steam, IMDb. The analysis is performed using Hadoop and Spark, which allows for efficient and scalable processing of large datasets. By examining close to 12 million reviews from these four online forums, we hope to uncover interesting trends in sales and customer sentiment over the years. Our analysis will include a study of the number of reviews and their distribution over time, as well as an examination of the relationship between various review attributes such as upvotes, creation time, rating, and sentiment. By comparing the reviews across different domains, we hope to gain insight into the factors that drive customer satisfaction and engagement in different product categories.

translated by 谷歌翻译

DePlot: One-shot visual language reasoning by plot-to-table translation

Fangyu Liu , Julian Martin Eisenschlos , Francesco Piccinno , Syrine Krichene , Chenxi Pang , Kenton Lee , Mandar Joshi , Wenhu Chen , Nigel Collier , Yasemin Altun

分类：自然语言处理 | 人工智能 | 计算机视觉

2022-12-20

Visual language such as charts and plots is ubiquitous in the human world. Comprehending plots and charts requires strong reasoning skills. Prior state-of-the-art (SOTA) models require at least tens of thousands of training examples and their reasoning capabilities are still much limited, especially on complex human-written queries. This paper presents the first one-shot solution to visual language reasoning. We decompose the challenge of visual language reasoning into two steps: (1) plot-to-text translation, and (2) reasoning over the translated text. The key in this method is a modality conversion module, named as DePlot, which translates the image of a plot or chart to a linearized table. The output of DePlot can then be directly used to prompt a pretrained large language model (LLM), exploiting the few-shot reasoning capabilities of LLMs. To obtain DePlot, we standardize the plot-to-table task by establishing unified task formats and metrics, and train DePlot end-to-end on this task. DePlot can then be used off-the-shelf together with LLMs in a plug-and-play fashion. Compared with a SOTA model finetuned on more than >28k data points, DePlot+LLM with just one-shot prompting achieves a 24.0% improvement over finetuned SOTA on human-written queries from the task of chart QA.

translated by 谷歌翻译

A Twitter BERT Approach for Offensive Language Detection in Marathi

Tanmay Chavan , Shantanu Patankar , Aditya Kane , Omkar Gokhale , Raviraj Joshi

分类：自然语言处理

2022-12-20

Automated offensive language detection is essential in combating the spread of hate speech, particularly in social media. This paper describes our work on Offensive Language Identification in low resource Indic language Marathi. The problem is formulated as a text classification task to identify a tweet as offensive or non-offensive. We evaluate different mono-lingual and multi-lingual BERT models on this classification task, focusing on BERT models pre-trained with social media datasets. We compare the performance of MuRIL, MahaTweetBERT, MahaTweetBERT-Hateful, and MahaBERT on the HASOC 2022 test set. We also explore external data augmentation from other existing Marathi hate speech corpus HASOC 2021 and L3Cube-MahaHate. The MahaTweetBERT, a BERT model, pre-trained on Marathi tweets when fine-tuned on the combined dataset (HASOC 2021 + HASOC 2022 + MahaHate), outperforms all models with an F1 score of 98.43 on the HASOC 2022 test set. With this, we also provide a new state-of-the-art result on HASOC 2022 / MOLD v2 test set.

translated by 谷歌翻译

KNIFE: Knowledge Distillation with Free-Text Rationales

Aaron Chan , Zhiyuan Zeng , Wyatt Lake , Brihi Joshi , Hanjie Chen , Xiang Ren

分类：自然语言处理 | 人工智能 | 机器学习

2022-12-19

Free-text rationales (FTRs) follow how humans communicate by explaining reasoning processes via natural language. A number of recent works have studied how to improve language model (LM) generalization by using FTRs to teach LMs the correct reasoning processes behind correct task outputs. These prior works aim to learn from FTRs by appending them to the LM input or target output, but this may introduce an input distribution shift or conflict with the task objective, respectively. We propose KNIFE, which distills FTR knowledge from an FTR-augmented teacher LM (takes both task input and FTR) to a student LM (takes only task input), which is used for inference. Crucially, the teacher LM's forward computation has a bottleneck stage in which all of its FTR states are masked out, which pushes knowledge from the FTR states into the task input/output states. Then, FTR knowledge is distilled to the student LM by training its task input/output states to align with the teacher LM's. On two question answering datasets, we show that KNIFE significantly outperforms existing FTR learning methods, in both fully-supervised and low-resource settings.

translated by 谷歌翻译